Corpus Analysis for Revision-Based Generation of Complex Sentences
نویسندگان
چکیده
The complex sentences of newswire reports contain oating content units that appear to be op-portunistically placed where the form of the surrounding text allows. We present a corpus analysis that identiied precise semantic and syntactic constraints on where and how such information is realized. The result is a set of revision tools that form the rule base for a report generation system, allowing incremental generation of complex sentences .
منابع مشابه
Basic Sentence Example
The complex sentences of newswire reports contain floating content units that appear to be opportunistically placed where the form of the surrounding text allows. We present a corpus analysis that identified precise semantic and syntactic constraints on where and how such information is realized. The result is a set of revision tools that form the rule base for a report generation system, allow...
متن کاملEvaluating the Robstness and Scalability of Revision-Based Natural Language Generation
This paper presents the rst quantitative, corpus-based evaluation of the same-domain robustness and scalability of a new revision-based language generation model, in comparison to the traditional single pass pipeline model. Robustness is deened as the proportion of sentences, in a given corpus test sample that can be generated using only knowledge structures abstracted from another sample. Scal...
متن کاملContent Analysis Table of Medical Ethics Book Based on Allport’s Theory of Value System
Introduction: Regular assessment of academic textbooks and revision of teaching methods are critical for making such textbooks more efficient in meeting the needs of the new generation and conveying values to them. Therefore, in line with the necessity of textbook evaluation, this research examined the extent to which the Medical Ethics book named “physicians and ethical considerations” observe...
متن کاملEvaluating the Portability of Revision Rules for Incremental Summary Generation
This paper presents a quantitative evaluation of the portability to the stock market domain of the revision rule hierarchy used by the system STREAK to incrementally generate newswire sports summaries. The evaluation consists of searching a test corpus of stock market reports for sentence pairs whose (semantic and syntactic) structures respectively match the triggering condition and application...
متن کاملGenerating Newswire Report Leads with Historical Information: a Draft and Revision Approach
In this paper I investigate the issue of providing historical background in computer-generated reports. I rst observe that ignoring this issue is the most drastic limitation of existing report generation systems. I then present an empirical corpus analysis of basketball summaries aimed at discovering the speciic means by which historical information is conveyed in human-generated reports. This ...
متن کامل